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BACKGROUND OF THE INVENTION 

Field of the Invention 

The invention relates generally to the field of electronic design automation 
(EDA), and particularly to a system and method for generically inferring sequential 
logic elements to generate a netlist for the desired circuit. 

Description of the Related Field 

EDA generally refers to a powerful process that allows designers to describe 
a desired digital circuit on an abstract, functional level, the Register Transfer Level 
(RTL) using Hardware Description Language (HDL). The functionality of the design 
can then be verified mathematically through circuit simulation. Logic synthesis 
automatically converts the RTL description to a gate-level netlist. Subsequent 
steps parse the netlist to place the gates on the chip floor. Finally, a routing 
process determines the physical layout necessary to create the requisite 
connections between the gates. From the physical layout, the silicon for the 
resulting integrated circuit can be fabricated. 

The logic synthesis step requires both the RTL expression of the desired 
circuit as well as a library of predefined sequential cells, or blocks of logic 
functionality. A sequential inferrer assembles sequential cells into a netlist of 
interconnected cells that have the functionality of the RTL description. 
Specifically, a sequential cell server provides the process with the specific cells 
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necessary to achieve the desired functionality. The server either obtains the 
requested cell directly from the library or, more significantly, it generates the 
requested cell by combining one or more functions from the basic cells in the 
library. When an inferred cell is necessary, the cell server compares the 
component parts of the requested cell against the library of available cells. The 
cells can be broken down into output portions, input portions, clock portions, and 
asynchronous portions. Candidate cells are selected from the library that match 
the output, clock and asynchronous portions of the requested cell. Then, the 
server determines whether each candidate cell's input function can be transformed 
via inhibition, transformation or combinational inference to match that of the 
requested cell. When an appropriate candidate is selected, the cell can be 
transformed and incorporated into the netlist. Typically, more than one candidate 
cell can be transformed to achieve the desired functionality. Therefore, the 
successful candidate cells can be further screened to optimize other necessary 
criteria such as size, power consumption, signal integrity, routing constraints and 
the like. 

An example of prior art logic synthesis involves the use of full rule-based 
transformation. As discussed above, the transformation process focuses on the 
data input portion of a sequential cell. Data input functions lie between the data 
input terminals and the clock mediated flipflop or latch. Thus, for the purposes of 
this invention, data input functions are always synchronous and the functions 
comprise the set of combinational elements that operate on the data input. These 
elements can either be simply a wire or a gate, including an AND gate (for a reset 
function), an OR gate (for a set function), a multiplexer ("MUX" - for selecting), or 
an inverter (to toggle). The elements are assigned a position based on their 
relation between the data input and the flipflop such that the closest element to 
the flipflop is the first position. 



The nature of the elements allows some reduction in the possibility of 
combinations. For example, the Toggle and MUX elements have forbidden 
positions (they must be set in the last position of the function), and the Toggle and 
MUX elements are mutually exclusive (they cannot be present in the same 
function). It is possible to simplify the process somewhat by constraining the 
number of elements that may be used. For example, allowing each element to be 
inferred only once results in 258 possible combinations without either a MUX or 
Toggle element, 258 combinations with a Toggle element and 170 combinations 
with a MUX element, for a total of 686 different functions. Accommodating each 
of these possible combinations with a full rule-based mechanism requires a total of 
470,596 transformation rules (686 possible requested cells multiplied by 686 
possible candidate cells). Allowing the elements to appear more than once 
exacerbates the problem. For example, allowing up to five positions results in 
more than 3000 functions and correspondingly requires more than one million 
rules. As exemplified below, this available system can accommodate simple 
transformations, but more complex functions illustrate the limitations of the 
system. 

In a first example, the requested cell has a data input function of Reset 
active high. To apply the full rule-based system, each candidate is screened to 
determine if it has a Reset element. If so, then the inferrer must confirm the 
polarity on the synchronous reset terminal and inhibit (replace with a simple 
connection) any other elements, such as Scan, ScanMux or Set. If the candidate 
does not contain a Reset element, then a Scan element can be set to a constant to 
transform it, the MUX and Set elements can be inhibited, or if no Scan is present a 
synchronous Reset can be inferred. Since only a few rules are necessary for this 
example, the rule based transformation can be adequate for relatively simple 
functions such as Set, Reset or Scan. 



A requested cell having an active high synchronous Set Reset function 
presents a more complicated example. For a simple flipflop candidate cell, the full 
rule based system requires the addition of appropriate AND and OR gates. If the 
candidate is an active low Scan flipflop, the active low Scan must be transformed 
5 into a Reset and an OR gate must be added. On the other hand, if the candidate is 
an active high ScanMux flipflop, the active high Scan is transformed into a Reset 
and the MUX is transformed into Set. Finally, if the candidate is an active high 
Reset high recirculating flipflop, the Reset is kept, the recirculating element is 
inhibited and an OR gate is added. This example demonstrates that a specific rule 
10 is required for each type of candidate cell depending on parameters such as 

p whether a Scan or ScanMux is available. Accordingly, the number of possible 

yQ 

jg combinations discussed above indicates that the full rule-based system is 

ffj essentially unmanageable for more sophisticated data input functions such as the 

O Scan Set Reset function. 

55 
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Another prior art sequential inferrer utilizes a Boolean matching system. In 
S. Krishnamoorthy, F. Maihlot "Boolean matching of sequential elements", 31st 
Design Automation Conference, 1994, the sequential cells are analogized to 
boolean expressions allowing the requested cell to be screened against the library. 
20 This teaching is hereby incorporated in its entirety by reference thereto for its 
disclosure of sequential inference and logic synthesis. However, this system 
works only if the requested cell finds an exact match in the library. It does not 
provide a means for transforming the basic cells of the library into requested cells 
that do not match. Further, the system does not maximize the number of 
25 candidates since only matches are identified, not candidates that could be 

transformed into matches. As discussed above, it is important to identify as many 
candidates as possible to permit optimization of other criteria. 
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Thus, there is a need for a sequential inferrer having a cell server capable of 
efficiently identifying candidate cells that may be transformed into a requested cell 
There is also a need for an inferrer capable of transforming sophisticated logic 
functions and cells comprising multiple elements. Similarly, there is a need for an 
5 inferrer that maximizes the number of suitable candidate cells from a given library. 
Additionally, there is a need for a sequential inferrer that permits accurate cost 
estimation of the resulting circuit. There is also a need for an inferrer that uses a 
minimum number of rules to accommodate the various transformations necessary 
to convert a candidate cell into a requested cell. This invention satisfies these and 
10 other needs. 
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SUMMARY OF THE INVENTION 

The present invention comprises a method for inferring a requested cell from 
a library of candidate cells during the generation of a netlist from the RTL 
description of a circuit. The method generally comprises representing the 
5 requested cell and the candidate cell as a mathematical expressions; performing an 
operation on the requested cell representation with the candidate cell 
representation to return at least one value; providing a rule corresponding to each 
returned value; and transforming the candidate cell into the requested cell by 
performing each rule corresponding to each returned value. 

10 

3 Preferably, the candidate cell and requested cell are expressed as 

rT polynomials comprising multinoms representing the functional synchronous 
y elements. Further, division of the requested cell polynomial by the candidate cell 
H polynomial returns inhibition, transformation and inference polynomials. A rule 
J5 corresponding to each multinom of the inhibition, transformation and inference 

polynomials is then applied to the candidate cell to give it the functionality of the 
*~ requested cell. 

vS The multinoms from which the polynomial expression of the invention are 

derived may be classified as either major or minor, where a minor monom can 

20 divide 1 but a major monom cannot. A given polynomial representation is divisible 
by another if all the major monoms present in the divisor are also present in the 
dividend. 

A table of rules for all the possible values of the inhibition, transformation 
25 and inference polynomials can easily be determined. Generally, the number of 
rules required is very manageable and represents a significant advantage over 
available systems that employ a full-rule based inference process. Preferably, the 
method is implemented in a computer to facilitate the EDA process. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



The aforementioned advantages of the invention, as well as additional 
advantages thereof, will be more fully understood as a result of a detailed 
description of a preferred embodiment when taken in conjunction with the 
accompanying drawings in which: 

FIG. 1 is a schematic representation of an active high synchronous Set Reset 

cell; 

FIG. 2 is a schematic representation of a flipflop with active low ScanMux; 

FIG. 3 is a schematic representation of a Toggle flipflop; 

FIG. 4 is a flow chart showing the process of the invention; 

FIG. 5 is a table of Inhibition rules to apply based on the results of the 
process; 

FIG. 6 is a table of Transformation rules to apply based on the results of the 
process; 

FIG. 7 is a table of Inference rules to apply based on the results of the 
process; 

FIG. 8 is table showing the possible combinations of multinom divisions; 

FIG. 9 is a schematic representation of a D input flipflop candidate cell; 

FIG. 10 is a schematic representation of an active low Scan flipflop 
candidate cell; and 

FIG. 1 1 is a schematic representation on an active high Reset active high 
Recirculating candidate cell. 
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DETAILED DESCRIPTION OF THE INVENTION 



Definitions 

Sequential cell - a f lipf lop or latch having one or more synchronous functions 
interposed between the data input terminals and the flipflop or latch and may also 
5 comprise asynchronous functions; 

Data input portion - the set of sequential cell terminals connected to functions 
controlled by the clock, i.e. the synchronous functions; 

Asynchronous portion - the set of sequential cell terminals connected to functions 
not controlled by the clock and having immediate effect on the outputs, such as 
10 clear and preset; 

jg Output portion - the set of sequential cell terminals that return the values produced 
J by the synchronous functions and stored in the flipflop, such as Q, Qbar and Qz; 
^ Clock portion - the cell input terminals carrying the clock signal to trigger the 
M= flipflop, such as Clock and ClockEnable; 

J5 Element - a synchronous subfunction, including Set, Reset, Mux, Scan, Recirc, and 
J? Toggle, implemented with AND, OR, MUX or INV; 

□ Function - one or more elements connected to the data input terminals that provide 
y3 the requested functionality; 

w Position - the order of connections of elements between the data input terminals 
20 and the flipflop, wherein the element adjacent the flipflop has the first position; 

Library - an existing set of sequential cells directly available for the inference 

process; 

Requested cell - a sequential cell determined by the inferrer during translation of 
the RTL; 

25 Candidate cells - available cells in the library that potentially match the 
functionality of the requested cell after manipulation such as inhibition, 
transformation or combinational inference; 

Sequential cell server - a tool that generates cells requested by the inferrer by 
interconnecting the library cells, wherein the server either directly matches a cell 
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from the library to the requested cell or builds the requested cell by manipulating a 
library cell; 

Transformation mechanism - the process by which a candidate cell is converted 
into a requested cell, including the specific inhibition, transformation and inference 
5 operations necessary to apply to a candidate cell to give it the requested 
functionality, ; 

Inhibition - the manipulation of a candidate cell wherein a given element is replaced 
by a connection; 

Transformation - the manipulation of a candidate cell wherein a given element is 
10 changed to another element; 

D Combinational inference - the manipulation of a candidate cell wherein a given 
J element is added; 

D Methodology 

jlJ5 The invention provides a method for converting identified candidate cells to 

generate the functionality of a requested cell. The requested cell is converted to a 
polynomial expression and then divided by the polynomial expression of the 

Zz 4 candidate cell. The result of the division dictates the necessary inhibition, 

yrj transformation and inference steps to convert the candidate cell into the requested 

20 cell. 

As described below, the concepts of polynomial theory may be applied to 
sequential cells. A polynomial comprises the sum of monoms taken to varying 
degrees. For example, in the polynomial P = X*3 + 4X + 7, X is a monom of 
25 degree 3, 1 and 0 respectively. Multinoms are the product of different monoms, 
all of which have a degree of 1 , 0 or -1 . A monom may be considered a 
degenerated multinom. In the set of reals, only 1 is a multinom since X^O = 1. 
These principles are demonstrated in the following examples: 
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X"1 = X 
X A 0 = 1 

XM = 1/X 

5 If X, Y and W are monoms, then 

XYW is a multinom, 

YW is a multinom, 

X is a multinom, 

W(X A -1) is a multinom, 
10 1 is a multinom, 
n (Y A 2)W is not a multinom, 
% (X A 3)(Y A -3) is not a multinom, and 
fj; 4 is not a multinom. 

j3N5 Monoms may be further classified as major or minor. Major monoms cannot 

J\ divide 1, while minor monoms can: 1/X = X A -1. 

SJ There are two primary rules that govern division of polynomial expression of 

5 sequential cells. First, if Ztype and Zref are multinoms, then Ztype is divisible by 
20 Zref if the major monoms of Zref are also present in Ztype. For example: 

If S, 0, R, Re and T are major monoms, and X, Y and N are minor monoms, then 

SOR is divisible by R, 
25 SOR is divisible by OY, 
Re is divisible by ReN, 
SORXY is not divisible by ReN, 
0 is not divisible by SOR, and 
T is not divisible by RX. 
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Second, prior to the division, the degree of the multinom is set to 1 . Thus: 
((SOR) A 3)/(OY) = SOR/OY = SR/Y 
((SORXY) A 3)/(SOR) A 2) = SORXY/SOR = XY 

The above rules and definitions regarding polynomials can be applied to 
sequential data input functions. For a given cell, each element is a multinom 
having a degree corresponding to the element's position in the cell. Using the 
major monoms Rst, St, Sc, Re and T and the minor monoms Lr, Ls, Mu and Lre. 
Accordingly, each element can be described with the following multinoms: 

Rst: Reset active high 
RstLr: Reset active low 
St: Set active high 
StLs: Set active low 
ScStRst: Scan active high 
ScStLsRstLr: Scan active low 
MuScStRst: Mux after Scan 
Re: Recirculating active high 
ReLre: Recirculating active low 
T: Toggle 

For example, Fig. 1 is a schematic representation of a sequential cell 10 
comprising a flipflop 12 with an active high synchronous Reset 14 and Set 16. 
The Reset 14 is adjacent to the flipflop 12 so it has position (and degree) one. 
Similarly, the Set 16 element is in position 2. Accordingly, the polynom 
representing cell 10 is Rst + St A 2. Fig 2 is a schematic representation of a 
ScanMux flipflop active low 18, that is, a flipflop 20 with a Scan active low 22 
and a Mux 24, and thus its polynomial representation is ScStLsRstLr + 
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(MuScStRstp2. Fig. 3 is a schematic representation of a Toggle flipflop 26. The 
Toggle is a wire 28 that loops inverted Q output back to the input. The polynomial 
for Toggle flipflop 26 is T. 

5 The transformation process of the invention is summarized in the flowchart 

shown in Fig. 4. The following abbreviations are used: 

Preq is the polynom of the requested cell and Zreq is a multinom portion of Preq; 
Pcand is the polynom of the candidate cell and Zcand is a multinom portion of 
Pcand; 

10 Pinhib is the polynom representing the elements to be inhibited on the candidate 
p cell as detailed in Fig. 5; 

jE Ptransform is the polynom representing the elements to modify by adding an 
f2 inverter or stocking an input terminal as detailed in Fig. 6; and 
□ Pinfer represents the elements to infer on the candidate cell data input terminal as 
rt15 detailed in Fig. 7. 

y The process begins at step 30, wherein the values of Pinhib, Ptransform and 

M Pinfer are set to 0. Next, in step 32, if either Pcand or Preq is equal to 0, then 
j3 step 34 is performed. If neither Pcand nor Preq is equal to 0, then step 36 is 
20 performed, wherein Zreq is set to equal the multinom of the smallest degree in 
Preq and Zcand is set to equal the multinom of the smallest degree in Pcand, that 
is also divisible by Zreq. Step 38 queries whether a suitable Zcand is found, that 
is, if Pcand has a multinom divisible by Zreq. If so, division subroutine step 40 
adds all multinoms of Pcand that are smaller than Zcand to Pinhib, removes Zcand 
25 from Pcand, removes Zreq from Preq and adds the result of the Zcand/Zreq division 
to Ptransform. The process now loops back to step 32 to check whether all the 
multinoms of Pcand and Preq have been addressed. From step 38, if Pcand does 
not have a multinom divisible by Zreq, step 42 sets Pinhib equal to Pcand and then 
sets Pcand to 0. After step 42, the process also loops back to step 38. Step 34 
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is performed when either Pcand or Preq initially equals 0 or has been reduced to 0 
by previous divison subroutine manipulation. In step 34, if Preq equals 0, then 
Pcand is added to Pinhib and if Pcand = 0 then Preq is added to Pinfer. 

5 The division subroutine 40 follows the general rules established above for 

polynomial division. The Sc, Rst, St, Re and T monoms are major monoms and 
must be present in Zcand if they are found in Zreq. For example: 
ScStRst is divisible by Rst (= ScSt); 
ScStRst is divisible by StY (= (ScRst)/Y); 
10 Re is divisble by ReLre (= Lre A -1); 

p ScStRstLrLs is not divisible by ReLre (Zcand does not have the major monom Re); 
St is not divisible by ScStRst (Zcand does not have the major monoms Sc and Rst); 

^ and 

\ t% 

O T is not divisble by RstLr (Zcand does not have the major monom Rst). 
gj5 Further, the degree of the multinoms Zreq and Zcand is set to 1 for the division. 
For example: 

2 ((ScStRst) A 3)/(StLs) = ScStRst/StLs = ScRst/Ls. 

SJ 

yrf The process of Fig. 4 returns three polynomials, Pinhib, Ptransform and 

20 Pinfer for a given Preq and Pcand. The candidate cell is thus subjected to an 
inhibition step, a transformation step and an inference step. Together, the 
inhibition, transformation and inference steps are considered the transformation of 
the candidate cell into the requested cell. First, for each multinom of Pinhib, the 
corresponding rule shown in Fig. 5 is applied to the candidate cell. Second, the 
25 transformation rule base shown in Fig. 6 is applied as dictated by Ptransform. 

Finally, the Pinfer multinom determines the elements to add to the candidate cell as 
shown in Fig. 7. The degrees of the Pinfer multinoms must be conserved so that 
the multinom of the lowest degree must be inferred first and placed adjacent the 
flipflop or latch. In other words, as described above, the degree of the Zcand and 
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Zreq leading to the Pinfer will determine where the inferred element belongs in the 
latch in relation to any other elements. 

Fig. 8 is a table illustrating all the possible combinations of the multinom 
5 divisions discussed above. The columns are the divisor while the rows are the 
dividends. The table demonstrates that the transformation rule base shown in Fig. 
6 accommodates all the possible division results. Accordingly, the inventive 
process offers significant advantages over prior art full rule based transformation 
logic synthesis. In the above examples, only 41 rules are necessary to perform all 
10 the transformation for all combinations of candidate cells and requested cells. In 
□ contrast, the prior art rule based systems require 470,596 rules for a comparable 
1? process. Thus, the inventive system is significantly more efficient. As a practical 

matter, a greater number of candidate cells can be generated due to the more 
O efficient transformation, allowing greater flexibility in optimizing the netlist for area, 
fl;5 power consumption, routing issues and the like. 

^ Examples 

'^J For the first example, the requested cell is an active high Reset Set flipflop 

yo as shown in Fig. 1 . The process of the invention may be compared to the prior art 
20 example discussed above in the background. The polynomial representation is 
Preq = Rst + St^2. If the candidate cell is a simple D flipflop, shown in Fig. 9, 
then Pcand = 0. Step 32 thus leads directly to step 34 where, since Pcand = 0, 
Preq is added to Pinfer. Pinhib and Ptransform maintain their starting values of 0. 
Accordingly, nothing is inhibited or transformed and Rst + St^2 is inferred. Since 
25 the Rst multinom has the lowest degree, it is placed in the first position and the 
Set element is placed in the second position. 

Alternatively, if the candidate cell is an active low scan flipflop as shown in 
Fig. 10, Pcand = ScStLsRstLr. The process is followed to step 36 where Zreq is 
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set to Rst and Zcand is set to ScStLsRstLr. Zcand is divisible by Zreq, so step 40 
sets Pinhib to the Pcand multinoms of smaller degree than Zcand, here 0. Step 40 
further subtracts Zcand from Pcand, thus setting Pcand to 0, subtracts Zreq from 
Preq, thus setting Preq to St A 2, and sets Ptransform to Zcand/Zreq, i.e. ScStLsLr. 
5 The process is then looped back to step 32, where the Pcand value of 0 then 
routes the process to step 34. Since Pcand is 0, Pinfer is set to Preq which now 
has the value of St^2. Thus, the result of the process is Pinhib = 0, Ptransform = 
ScStLsLr and Pinfer = St*2. From Figs. 5-7, the corresponding operations 
performed on the candidate cell is to inhibit nothing, set Tl to Vss and connect TE 

10 to the Reset terminal with an inverter (converting the Scan element to a Reset 

P element with the correct polarity) and infer a Set. 

fz For the final example, the candidate cell is a flipflop with an active high 

B Reset and an active high Recirculating as shown in Fig. 1 1. Thus, Preq is still Rst 
p + St"2 and Pcand is Rst + Re^2. Step 36 sets Zreq to Rst and Zcand to Rst. 

Since the division is possible, step 40 adds 0 to Pinhib as there are no multinoms 
D of smaller degree in Pcand. Next, Rst is subtracted from Pcand leaving Re^2 and 
^4 Rst is subtracted from Preq leaving St A 2. Finally, Ptransform is given a value of 1, 
T| for the value of Zreq/Zcand. The process is looped back to step 32 and then to 36 
20 as neither Pcand nor Preq is 0 yet. At step 36, Zreq is set to St A 2 and Zcand is 
set to Re A 2. Step 38 routes the process to step 42 as Zcand is not divisible by 
Zreq. Then Pcand (Re^2) is added to Pinhib and Pcand is set to 0. The process is 
looped back to step 32 and then to 34. Since Pcand is now 0, Preq is added to 
Pinfer giving it a value of St A 2. Thus, Pinhib has the value Re A 2, Ptransform has 
25 the value 1 and Pinfer has the value St^2. Thus, from Figs. 5-7, the Recirculating 
active high is inhibited, nothing is transformed and a Set is inferred. 



The inventive process can be easily adapted to accommodate additional 
elements. A dedicated multinom is added for the new element and a few 
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additional rules are added to the inhibition, transformation and inference rule bases, 
The process remains unchanged. For example, a JK element can be incorporated 
by adding only 10 extra rules to the inhibition, transformation and inference rule 
bases. In contrast, inclusion of a JK element in the prior art rule based approach 
would multiply the number of required rules by a factor of four. Further, the 
process can be optimized if necessary. Certain elements can be further described 
by additional monoms. For example, a monom can be used to indicate that a Scan 
element is in the last position, allowing it to be considered a Mux. 

Described herein is a preferred embodiment, however, one skilled in the art 
that pertains to the present invention will understand that there are equivalent 
alternative embodiments. 
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